Large Scale Integration, Analysis, and Visualization of Biological Data
نویسنده
چکیده
OF THE DISSERTATION Large Scale Integration, Analysis, and Visualization of Biological Data By Vishal Rajesh Patel Doctor of Philosophy in Computer Science University of California, Irvine, 2014 Professor Pierre Baldi, Chair Data from decades of life sciences research and literature is being curated and made available for searching and analysis. While considerable work has been done to integrate and re-use this data, what is still lacking is a unifying platform that allows new experimental data to leverage all previously published data effectively. Crick is an intelligent and scalable platform for data integration, visualization, and searching for meaningful biological hypothesis. It was built to create an effective way to integrate and analyze experimental data in the context of the vast literature of other biologically relevant information. Crick was designed ground-up to solve some of the most challenging problems in biological data such as entity resolution, size, scale, reliability of the data, visualization of high-dimensional information, etc. Crick has been successfully used to identify molecular mechanism regulating circadian metabolism; to understand the complex coupling of circadian oscillating species; to study pediatric cancers; to analyze the dynamic long-range interactions in the genome; and more.
منابع مشابه
Analysis and Visualization of Gene Expressions and Protein Structures
This paper describes a web-based interactive framework for the analysis and visualization of gene expressions and protein structures. The formulation of the proposed framework was encountered by many challenges due to the wide range of relevant analysis and visualization techniques, in addition to the existence of a diversity of biological data types, on which these techniques operate. The main...
متن کاملBiologicalNetworks: visualization and analysis tool for systems biology
Systems level investigation of genomic scale information requires the development of truly integrated databases dealing with heterogeneous data, which can be queried for simple properties of genes or other database objects as well as for complex network level properties, for the analysis and modelling of complex biological processes. Towards that goal, we recently constructed PathSys, a data in...
متن کاملSemantics for Big Data Integration and Analysis
Much of the focus on big data has been on the problem of processing very large sources. There is an equally hard problem of how to normalize, integrate, and transform the data from many sources into the format required to run large-scale analysis and visualization tools. We have previously developed an approach to semi-automatically mapping diverse sources into a shared domain ontology so that ...
متن کاملNetwork analysis approach for biology.
The biological system is a complex physicochemical system consisting of numerous dynamic networks of biochemical reactions and signaling interactions between cellular components. This complexity makes it virtually unanalyzable by traditional methods. Hence, biological networks have been developed as a platform for integrating information from high- to low-throughput experiments for analysis of ...
متن کاملSecondary Use of Laboratory data: Potentialities and Limitations
Clinical databases have been developed in recent years especially during the course of all medical concerns including laboratory results. The information produced by the diagnostic laboratories have great impact on health care system with various secondary uses. These uses are sometimes as publishing new extracted information of laboratory reports which have been widely applied in the scientifi...
متن کامل